LLM Comparison | Blog | SUZUKI SHOTEN

LLM Comparison

1 article tagged with "LLM Comparison"

Part 2. OpenClaw Code Generation — Gemini Flash's Failure and Switching to Claude Delegation

When I let OpenClaw's default setup (Gemini 3 Flash) write the Python rule-check code, the AI ended up hardcoding the expected answers after 25 rewrites. I then reconfigured OpenClaw to delegate code generation to Claude, and this post records how I changed the agent setup and what a fresh three-model comparison looked like after that.

2026-04-15 Read →